#AI Agents

59 articles

TechMar 27, 20266 min

HyperAgents shows that improving the way you improve can transfer beyond coding

Meta AI's HyperAgents performs metacognitive self-correction that optimizes improvement strategies themselves. Self-improvement appears in four non-coding domains, and strategies learned in one domain transfer to another, along with spontaneously acquired persistent memory.

MachineLearning AI AI Agents Open Source Research

TechMar 18, 20264 min

Holotron-12B Makes PC-Operation AI 1.7× Faster, and Unsloth Studio Lets You Tune Models Without Code

H Company's Holotron-12B uses a memory-efficient new design to lift PC-operation AI throughput to 8,900 tokens per second. Unsloth has released the beta of 'Studio,' a browser tool for no-code model fine-tuning.

AI LLM AI Agents Unsloth Local LLM

TechMar 10, 20265 min

OpenAI's Promptfoo acquisition and Microsoft's shift to a multimodel stack

OpenAI acquired AI security evaluation platform Promptfoo, and Microsoft announced that Anthropic's Claude Cowork would be integrated into Microsoft 365 Copilot. The structure of the enterprise AI market is starting to change.

OpenAI Microsoft Anthropic Claude Security Copilot AI AI Agents

TechMar 10, 20267 min

Karpathy's Autoresearch lets AI run 100 ML experiments while you sleep

Andrej Karpathy released Autoresearch, a system where an AI agent autonomously runs machine-learning experiments on a GPU and tries 100 variants overnight. The article breaks down the mechanism and design so even readers with zero ML background can follow.

AI MachineLearning LLM AI Agents OSS

TechFeb 25, 20269 min

AMOS turns AI agents into a delivery vehicle via malicious OpenClaw SKILL.md on macOS

Trend Micro analyzed a new AMOS distribution method that targets AI agent workflows. A malicious SKILL.md on OpenClaw plants fake CLI install instructions and uses AI as the intermediary to manipulate people.

Security macOS AI Agents Malware Supply Chain OpenClaw

TechFeb 24, 2026updated7 min

Injection Attacks on AI Agent Memory and Automated Smart Contract Exploitation with EVMbench

Techniques and defenses from the MINJA, InjecMEM, and ToxicSkills campaigns that poison AI agents’ memory files, and the fact that GPT-5.3-Codex achieved a 72% exploit success rate on EVMbench released by OpenAI and Paradigm. This article organizes how AI becomes both a target of attacks and a weapon for attackers.

Security AI Agents Prompt Injection MCP Ethereum Smart Contracts OpenAI Supply Chain

TechFeb 23, 2026updated9 min

Design Principles for Running AI Coding Agents in Production

Stripe Minions, Amazon Kiro, Claude Code compaction, and a Replit DB deletion. We synthesize multiple cases to extract the design principles required to operate AI coding agents in production, and organize them alongside CodeRabbit's 470‑repo statistics plus efforts from Google and GitHub.

AI Agents Stripe MCP Coding Agents AWS Amazon Claude Code incidents Design Replit

TechFeb 22, 2026updated7 min

AI Agent Orchestration: Claws and Cord

Andrej Karpathy coined "Claws" as an upper layer for AI agents, and June Kim answered the same question from a different angle with the Cord framework implemented with MCP and SQLite. This piece organizes the shift from single-shot agents to autonomous coordination systems from both conceptual and implementation perspectives.

AI AI Agents MCP LLM Architecture Karpathy

TechFeb 21, 2026updated8 min

Three Failure Modes of AI Coding Tools (Production Deletion, Context Loss, Quota Exhaustion)

Kiro autonomously deleted production, causing 13 hours of AWS downtime; Claude Code’s auto-compaction irreversibly erases context; sub-agents silently burn through usage. Three incident reports from the same week.

AI AI Agents Claude Code AWS Amazon Coding Agents incidents Design

TechFeb 21, 2026updated7 min

Inside the Architecture of Stripe’s AI Coding Agent “Minions”

Stripe’s Minions agent generates 1,300+ PRs per week with zero human effort. Implementation details of the four components: Devbox, Blueprints, Toolshed, and a fork of goose.

AI Agents Stripe MCP Coding Agents Architecture

TechFeb 19, 2026updated5 min

How IT-Bench and MAST expose enterprise AI agent failure modes

Using IBM and UC Berkeley's IT-Bench benchmark and the MAST failure taxonomy, this article examines why enterprise AI agents fail. It covers the reality of 11% SRE success and 0% FinOps success, plus the Replit production database deletion incident.

AI AI Agents IBM Benchmark Enterprise